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Abstract. Finite frame theory has a number of real- world applications. In applications like 
sparse signal processing, data transmission with robustness to erasures, and reconstruction 
without phase, there is a pressing need for deterministic constructions of frames with the 
following property: every size-M subcollection of the M-dimensional frame elements is 
a spanning set. Such frames are called full spark frames, and this paper provides new 
constructions using the discrete Fourier transform. Later, we prove that full spark Parseval 
frames are dense in the entire set of Parseval frames, meaning full spark frames are abundant, 
even if one imposes an additional tightness constraint. Finally, we prove that testing whether 
a given matrix is full spark is hard for NP under randomized polynomial-time reductions, 
indicating that deterministic full spark constructions are particularly significant because 
they guarantee a property which is otherwise difficult to check. 



1. Introduction 

A frame over a Hilbert space H is a collection of vectors {fi}i^x ^ "H with frame bounds 
< A < B < oo such that 

A\\xr<Y,\{^,h)\'<BM' VxG?/. (1) 

iex 

For a finite frame {fn}n=i M-dimensional Hilbert space Mm, the upper frame bound 

B which satisfies ([T]) trivially exists. In this finite-dimensional setting, the notion of being a 
frame solely depends on the lower frame bound A being strictly positive, which is equivalent 
to having the frame elements span the Hilbert space. 

In practice, frames are chiefiy used in two ways. The synthesis operator F: — )■ Mm of 
a finite frame F = {fn}n=i is defined by Fy := ^^=i2/H/n- As such, the M x N matrix 
representation F of the synthesis operator has the frame elements {fn}n=i ^s columns, with 
the natural identification Mm = C^. Note that here and throughout, we make no notational 
distinction between a frame F and its synthesis operator. The adjoint of the synthesis 
operator is called the analysis operator F* : Mm — ?• C^, defined by {F*x)[n] := {x,fn)- 
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Classically, frames are used to redundantly decompose signals, y = F*x, before synthe- 
sizing the corresponding frame coefficients, z = Fy = FF*x, and so the frame operator 
FF* : M.M — >■ EIm is often analyzed to determine how well this process preserves information 
about the original signal x. In particular, if the frame bounds are equal, the frame operator 
has the form FF* = AIm, and so signal reconstruction is rather painless: x = ^FF*x; in this 
case, the frame is called tight. Oftentimes, it is additionally desirable for the frame elements 
to have unit norm, in which case the frame is a unit norm frame. Moreover, the worst-case 
coherence between unit norm frame elements /i := maxj^j fj) \ satisfies /i^ > -^^j^z^, and 
equality is achieved precisely when the frame is tight with fj)\ = for all distinct pairs 
i,j G {1, . . . ,N} IH]; in this case, the frame is called an equiangular tight frame (ETF). 

The utility of unit norm tight frames and ETFs is commonly expressed in terms of a 
scenario in which frame coefficients {{F*x)[n\}^^-^ are transmitted over a noisy or lossy 
channel before reconstructing the signal: 

y = ViF*x), x = {FF*y'Fy, (2) 

where T>{-) represents the channel's random and not-necessarily-linear deformation process. 
Using an additive white Gaussian noise model, Goyal [23j established that, of all unit norm 
frames, unit norm tight frames minimize mean squared error in reconstruction. For the case 
of a lossy channel. Holmes and Paulsen [26] established that, of all tight frames, unit norm 
tight frames minimize worst-case error in reconstruction after one erasure, and that equian- 
gular tight frames minimize this error after two erasures. We note that the reconstruction 
process in ([2]), namely the application of (FF*)~^F, is inherently blind to the effect of the 
deformation process of the channel. This contrasts with Piischel and Kovacevic's more recent 
work [Sn], which describes an adaptive process for reconstruction after multiple erasures; we 
will return to this idea later. 

Another application of frames is sparse signal processing. This field concerns signal classes 
which are sparse in some basis. As an example, natural images tend to be nearly sparse in 
the wavelet basis [H] . Some applications have signal sparsity in the identity basis |32] . Given 
a signal x, let \Ef represent its sparsifying basis and consider 

y = F'i>*x + e, (3) 

where F is the M x N synthesis operator of a frame, '^*x has at most K nonzero entries, 
and e is some sort of noise. When given measurements y, one typically wishes to reconstruct 
the original vector x; viewing F as a sensing matrix, this process is commonly referred to 
as compressed sensing since we often take M <^ N. Note that y can be viewed as a noisy 
linear combination of a few unknown frame elements, and the goal of compressed sensing 
is to determine which frame elements are active in the combination, and further estimate 
the scalar multiples used in this combination. This problem setup is very related to that of 
sparse approximation, in which signals are known to be expressible as a sparse combination 
of elements from an overcomplete dictionary; in this case, the dictionary elements form the 
columns of F, and we again wish to determine the scalar multiples used in a given sparse 
combination. 
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In order to have any sort of inversion process for g, even in the noiseless case where e = 0, 
F must map i^'-sparse vectors injectively. That is, we need Fxi 7^ Fx2 for any distinct K- 
sparse vectors xi and X2- Subtraction then gives that 2if-sparse vectors hke xi — X2 cannot 
be in the nullspace of F. This identification led to the following definition |16j : 

Definition 1. The spark of a matrix F is the size of the smallest linearly dependent subset 
of columns, i.e., 

Spark(F) = min{||x||o : Fx = 0, x 7^ 0}. 

Using the above analysis, Donoho and Elad \l6l showed that a signal x with sparsity level 
< Spark(F)/2 is necessarily the unique sparsest solution to y = Fx, and furthermore, there 
exist signals x of sparsity level > Spark(F)/2 which are not the unique sparsest solution 
to y = Fx. This demonstrates that matrices with larger spark are naturally equipped to 
support signals with larger sparsity levels. One is naturally led to consider matrices that 
support the largest possible sparse signal class; we say an M x N matrix F is full spark if 
its spark is as large as possible, i.e., Spark(F) = M + 1. Equivalently, M x N full spark 
matrices have the property that every M x M submatrix is invertible; as such, a full spark 
matrix is necessarily full rank, and therefore a frame. 

That being said, while the submatrices of a full spark matrix will be invertible, they may 
not be well-conditioned. Moreover, the conditioning of these submatrices is an important 
feature for compressed sensing; Candes [8] gives an elegant proof that sensing matrices with 
well-conditioned submatrices, specifically, satisfying the restricted isometry property (RIP), 
allow for stable and efficient recovery. Unfortunately, for deterministic matrices, it is difficult 
to determine the conditioning of every submatrix; to date, no deterministic RIP matrix is 
known to perform optimally [I9], and in one respect, this difficulty is precisely the barrier 
to significant progress on the Kadison-Singer problem [13]. However, some work has been 
done to test whether most submatrices are well-conditioned |17j, and better yet, other work 
has managed to achieve RIP-like performance without requiring all submatrices to be well- 
conditioned [1]. Regardless, good sensing matrices for compressed sensing must necessarily 
have large spark to enable reconstruction [16], and so building such matrices could serve as 
one step toward optimal deterministic RIP matrices. 

In sparse signal processing, the specific application of full spark frames has been studied 
for some time. In 1997, Gorodnitsky and Rao [22j first considered full spark frames, referring 
to them as matrices with the unique representation property (for reasons discussed above). 
Since [22], the unique representation property has been explicitly used to find a variety 
of performance guarantees for sparse signal processing [HI [33], |50]. Tang and Nehorai [15] 
also obtain performance guarantees using full spark frames, but they refer to them as non- 
degenerate measurement matrices. 

Consider a matrix whose entries are independent continuous random variables. Intuitively, 
the matrix is full spark with probability one, and this fact was recently proved in [H]. However, 
this random process does not allow one to control certain features of the matrix, such as unit- 
norm tightness or equiangularity. Indeed, ETFs are notoriously difficult to construct, but 
they appear to be particularly well-suited for sparse signal processing [Il[T9l|32]. For example, 
Bajwa et al. [1] uses Steiner ETFs to recover the support of \E'*a; in (|3|; given measurements 
y, the largest entries in the back-projection F*y often coincide with the support of \E'*x. 
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However, Steiner ETFs have particularly small spark [20], and back-projection will correctly 
identify the support of x even when the corresponding columns in F are dependent; in this 
case, there is no way to estimate the nonzero entries of \E'*x. This illustrates one reason to 
build deterministic full spark frames: their submatrices are necessarily invertible, making it 
possible to estimate these nonzero entries. 

For another application of full spark frames, we return to the problem ^ of reconstructing 
a signal from distorted frame coefficients. Specifically, we consider Piischel and Kovacevic's 
work [36j, which focuses on an Internet-like channel that is prone to multitudes of erasures. 
In this context, they reconstruct the signal after first identifying which frame coefficients were 
not erased; with this information, the signal can be estimated provided the corresponding 
frame elements span. In this sense, full spark frames are maximally robust to erasures, as 
coined in [36]. In particular, an M x full spark frame is robust to — M erasures since 
any M of the frame coefficients will uniquely determine the original signal. 

Yet another application of full spark frames is phaseless reconstruction, which can be 
viewed in terms of a channel, as in (|2]); in this case, T'(-) is the entrywise absolute value 
function. Phaseless reconstruction has a number of real-world applications including speech 
processing [5], X-ray crystallography [lOj, and quantum state estimation ^39j. As such, there 
has been a lot of work to reconstruct an M-dimensional vector (up to an overall phase factor) 
from the magnitudes of its frame coefficients, most of which involves frames in operator space, 
which inherently require A^ = f2(M^) measurements |3l|39]. However, Balan et al. |4j show 
that if an M X A^ real frame F is full spark with A^ > 2M — 1, then T> o F* is injective, 
meaning an inversion process is possible with only A^ = 0(M) measurements. This result 
prompted an ongoing search for efficient phaseless reconstruction processes [H |10], but no 
reconstruction process can succeed without a good family of frames, such as full spark frames. 

Despite the fact that full spark frames have a multitude of applications, to date, there 
has not been much progress in constructing deterministic full spark frames, let alone full 
spark frames with additional desirable properties. A noteworthy exception is Piischel and 
Kovacevic's work in which real full spark tight frames are constructed using polynomial 
transforms. In the present paper, we start by investigating Vandermonde frames, harmonic 
frames, and modifications thereof. While the use of certain Vandermonde and harmonic 
frames as full spark frames is not new [6[ |9|, |2T] , the fruits of our investigation are new: For 
instance, we demonstrate that certain classes of ETFs are full spark, and we characterize 
the M X N full spark harmonic frames for which A^ is a prime power. The remainder of the 
paper proves two results which might be considered folklore-type observations — while their 
proofs are new, the results are not surprising. First, in Section 3, we show that M x N full 
spark Parseval frames form a dense subset of the entire collection of M x A^ Parseval frames. 
As such, full spark frames are abundant, even after imposing the additional condition of 
tightness. This result is balanced with Section 4, in which we prove that verifying whether a 
matrix is full spark is hard for NP under randomized polynomial-time reductions. In other 
words, assuming NP ^ BPP (a computational complexity assumption slightly stronger than 
P 7^ NP and nearly as widely believed), then there is no method by which one can efficiently 
test whether matrices are full spark. As such, the deterministic constructions in Section 2 
are significant in that they guarantee a property which is otherwise difficult to check. 
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2. Deterministic constructions of full spark frames 



A square matrix is invertible if and only if its determinant is nonzero, and in our quest 
for deterministic constructions of full spark frames, this characterization will reign supreme. 
One class of matrices has a particularly simple determinant formula: Vandermonde matrices. 
Specifically, Vandermonde matrices have the following form: 



V 



and square Vandermonde matrices, i.e., with 
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1 


1 ■ 
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OLN 
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■ "TV J 



(4) 



M, have the following determinant: 



det(V^)= W (5) 

l<i<j<M 

Consider Q in the case where > M. Since every M x M submatrix of V is also Van- 
dermonde, we can modify the indices in ([s]) to calculate the determinant of the submatrices. 
These determinants are nonzero precisely when the bases {an}n=i are distinct, yielding the 
following result: 

Lemma 2. A Vandermonde matrix is full spark if and only if its bases are distinct. 

To be clear, this result is not new. In fact, the full spark of Vandermonde matrices was first 
exploited by Fuchs [21j for sparse signal processing. Later, Bourguignon et al. [B] specifically 
used the full spark of Vandermonde matrices whose bases are sampled from the complex unit 
circle. Interestingly, when viewed in terms of frame theory, Vandermonde matrices naturally 
point to the discrete Fourier transform: 

Theorem 3. The only M x N Vandermonde matrices that are equal norm and tight have 
bases in the complex unit circle. Among these, the frames with the smallest worst-case 
coherence have bases that are equally spaced in the complex unit circle, provided N > 2M . 

Proof. Suppose a Vandermonde matrix is equal norm and tight. Note that a zero base will 
produce the zeroth identity basis element Sq. Letting V denote the indices of the nonzero 
bases, the fact that the matrix is full rank implies \V\ > M — 1. Also, equal norm gives that 
the frame element length 



M-1 



M-1 



Wfn 



a 



m\2 



is constant over n eV. Since V 



m=0 
M- 



m=0 



M-1 

E 

m=0 



a, 



1 2m 



1 ^2m jg strictly increasing over < x < cxd, there exists 
c > such that \an\^ = c for all n eV. Next, tightness gives that the rows have equal norm, 
implying that the first two rows have equal norm, i.e., \V\c = \V\(?. Thus c = 1, and so the 
nonzero bases are in the complex unit circle. Furthermore, since the zeroth and first rows 
have equal norm by tightness, we have \V\ = N, and so every base is in the complex unit 
circle. 



m=0 
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Figure 1. Plot of g defined by (|6| in the case where M = 5. Observe (i) 
that g is strictly decreasing on the interval (0, and (ii) that g{x) < gij^) 
for every x G {j^^, ^). As established in the proof of Theorem |3| g behaves in 



this manner for general values of M. 

Now consider the inner product between Vandermonde frame elements whose bases {e^'^*^"}^^]^ 
come from the complex unit circle: 

M-1 M-1 



m=0 



m=0 



We will show that the worst-case coherence comes from the two closest bases. Consider the 
following function: 



g{x) :-- 



M-1 
m=0 



2nixm 



(6) 



Figure [T] gives a plot of this function in the case where M = 5. We will prove two things 
about this function: 

(i) £9{x) < for every x G (0, 

(ii) 9{x) < g{^) for every x e 1 - 2^^)- 

First, we claim that (i) and (ii) are sufficient to prove our result. To establish this, we 
first show that the two closest bases e'^'^^^^' and e^'^*^"" satisfy |x„/ — Xn"\ < Without 
loss of generality, the n's are ordered in such a way that {xn}n=d — [0' l) nondecreasing. 
Define 

Xn+l-Xn, n = 0,...,N -2 

xo-{xn-i-1), n = N-l, 

and let n' be the n which minimizes d{xn, Xn+i)- Since the minimum is less than the average, 
we have 

If ^"^ \ 1 1 

d{Xn',Xn'+l) < ^ ((Xo - (xjV-l - 1)) + ^(^n+l - Xn) j = ^ < (7) 
^ n=0 ^ 

provided > 2M. Note that if we view {xn^n^z^^ &s members of M/Z, then d{xni Xn+i) — 
Xn+i — Xn- Since g{x) is even, then (i) implies that |(/„'+i, /n')P = 9{.Xn'-\-i — Xn') is larger 
than any other g{xp - Xp>) = \{fp, fpi)^^ in which Xp - Xpi G [0, 2^7] U [1 - i^^, 1). Next, ^ 



n+ly 



FULL SPARK FRAMES 



7 



and (ii) together imply that /n')P = g{xn'+i — Xn') > fi'(^) is larger than any other 

g{xp — Xpf) = \ {fp, /p')P in which Xp — Xp' G 1 — provided > 2M. Combined, (i) 
and (ii) give that | {fn'+i, fn') \ achieves the worst-case coherence of {/njneZjv- Additionally, (i) 
gives that the worst-case coherence is minimized when Xn'+i —Xn' is maximized, 

i.e., when the equally spaced in the unit interval. 

To prove (i), note that the geometric sum formula gives 



M-l 
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^2Mmx _ 










m=0 





2 - 2 cos(2M7rx) _ |^sin(M7rx 
2 -2cos(27rx) " 



sm vrx 



(8) 



where the final expression uses the identity 1— cos(22;) = 2 sin^ z. To show that g is decreasing 
over (0, 2^), note that the base of ([s]) is positive on this interval, and performing the quotient 
rule to calculate its derivative will produce a fraction whose denominator is nonnegative and 
whose numerator is given by 

Mvr sin(7rx) cos(M7rx) — vr sin(M7rx) cos(7rx). (9) 

This factor is zero at x = and has derivative: 

-(M^ - l)7r^ sin(7ra;) sin(M7rx), 

which is strictly negative for all x E (0, 2^). Hence, ^ is strictly negative whenever 
X G (0, 2Tj), and so g'{x) < for every x G (0 ^ 



1 



For (ii), note that for every x G (2^, 1 — 2!^), we can individually bound the numerator 
and denominator of what the geometric sum formula gives: 



g{^) 
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m=0 
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□ 



Consider the N x N discrete Fourier transform (DFT) matrix, scaled to have entries of 
unit modulus: 
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where u = e'^""'/^ . The first M rows of the DFT form a Vandermonde matrix of distinct 
bases {cu^I^Jq^; as such, this matrix is full spark by Lemma |2| In fact, the previous result 
says that this is in some sense an optimal Vandermonde frame, but this might not be the best 
way to pick rows from a DFT. Indeed, several choices of DFT rows could produce full spark 
frames, some with smaller coherence or other desirable properties, and so the remainder 
of this section focuses on full spark DFT submatrices. First, we note that not every DFT 
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submatrix is full spark. For example, consider the 4x4 DFT: 

1 1 1 " 

-i —1 i 
-1 1 -1 ■ 
i —1 —i 

Certainly, the zeroth and second rows of this matrix are not full spark, since the zeroth 
and second columns of this submatrix form the all-ones matrix, which is not invertible. So 
what can be said about the set of permissible row choices? The following result gives some 
necessary conditions on this set: 

Theorem 4. Take an N x N discrete Fourier transform matrix, and select the rows indexed 
by A4 C. Zjv to build the matrix F. If F is full spark, then so is the matrix built from rows 
indexed by 

(i) any translation of Ai, 

(ii) any AAi with A relatively prime to N , 

(iii) the complement of M. in Zjv- 

Proof. For (i), we first define D to be the N x N diagonal matrix whose diagonal entries arc 
{lu^}^~q. Note that, since (^("^+1)" = uj'^uf^'^ , translating the row indices by 1 corresponds 
to multiplying F on the right by D. For some set /C C Z^v of size M := |A^|, let Fyc denote 
the M X M submatrix of F whose columns are indexed by /C, and let D)c denote the M x M 
diagonal submatrix of D whose diagonal entries are indexed by /C. Then since Djc is unitary, 
we have 

|det((FD);c)| = \detiF,cDK)\ = |det(F;c)||dct(D;c)| = |det(F^)|. 

Thus, if F is full spark, \dct{{FD)ic)\ = |det(Fx:)| > 0, and so FD is also full spark. Using 
this fact inductively proves (i) for all translations of Ai . 

For (ii), let G denote the submatrix of rows indexed by AM.. Then for any set /C C Zjv 
of size M, 

det(G';c) = det{J^"'^'U^MM>c = det{cu^<^''\,eM,keic = det(F^;c). 

Since A is relatively prime to N, multiplication by A permutes the elements of Z^r, and so 
AJC has exactly M distinct elements. Thus, if F is full spark, then det(G'yc) = det(F^jc) 7^ 0, 
and so G is also full spark. 

For (iii), we let G be the {N — M) x N submatrix of rows indexed by M.^, so that 

= F*F + G*G. (10) 

We will use contraposition to show that F being full spark implies that G is also full spark. 
To this end, suppose G is not full spark. Then G has a collection of iV — M linearly dependent 
columns {gi}ieic, and so there exists a nontrivial sequence {ajjieAC such that 

^ aiQi = 0. 



1 
1 
1 
1 



NIn - [F* G*] 
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Considering Qi = G6i, where 6i is the ith identity basis element, we can use (10) to express 
this hnear dependence in terms of F: 

= G*0 = G*J2 (^i9^ = J2 (^iG*G6, = J2 (^^{NIN - F*F)Si. 
Rearranging then gives 

x:=nJ2 "i^i = (^iF*F5i. (11) 

Here, we note that x is nonzero since {ajjjgx: is nontrivial, and that x G Range(F*F). 
Furthermore, whenever j ^ /C, we have from ( pT| that 

{x,F*F6,) = = iv/F*F^aA,5A =ivV^a,5„5A =0, 

and so X ± Spa.n{F*F6j}j^ic'^. Thus, the containment Span{F*F(5j}jg/c<: ^ Range(F*F) is 
proper, and so 

M = Rank(F) = Rank(F*F) > Rank(F*F/cc) = Rank(F,cO- 

Since the M x M submatrix F/^c is rank-deficient, it is not invertible, and therefore F is not 
full spark. □ 

We note that our proof of (iii) above uses techniques from Cahill et al. [7|, and can be 
easily generalized to prove that the Naimark complement of a full spark tight frame is also 
full spark. Theorem |4] tells us quite a bit about the set of permissible choices for DFT rows. 
For example, not only can we pick the first M rows of the DFT to produce a full spark 
Vandermonde frame, but we can also pick any consecutive M rows, by Theorem |4]^i). We 
would like to completely characterize the choices that produce full spark harmonic frames. 
The following classical result does this in the case where is prime: 

Theorem 5 (Chebotarev, see |llj). Let N be prime. Then every square submatrix of the 
N X N discrete Fourier transform matrix is invertible. 

As an immediate consequence of Chebotarev's theorem, every choice of rows from the DFT 
produces a full spark harmonic frame, provided is prime. This application of Chebotarev's 
theorem was first used by Candes et al. [9J for sparse signal processing. Note that each of 
these frames are equal-norm and tight by construction. Harmonic frames can also be designed 
to have minimal coherence; Xia et al. |19] produces harmonic equiangular tight frames by 
selecting row indices which form a difference set in Z^r. Interestingly, most known families 
of difference sets in Zat require A^ to be prime ^7\, and so the corresponding harmonic 
equiangular tight frames are guaranteed to be full spark by Chebotarev's theorem. In the 
following, we use Chebotarev's theorem to demonstrate full spark for a class of frames which 
contains harmonic frames, namely, frames which arise from concatenating harmonic frames 
with any number of identity basis elements: 

Theorem 6 (cf. [IHl Theorem 1.1]). Let N be prime, and pick any M < N rows of the 
N X N discrete Fourier transform matrix to form the harmonic frame H . Next, pick any 
K < M, and take D to be the M x M diagonal matrix whose first K diagonal entries are 
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' N+K~M 



Y MN ^'^^ whose remaining M — K entries are y . Then concatenating DH with 
the first K identity basis elements produces an M x [N + K) full spark unit norm tight frame. 

As an example, when = 5 and K = 1, we can pick M = 3 rows of the 5x5 DFT which 
are indexed by {0, 1, 4}. In this case, D makes the entries of the first DFT row have size 



and the entries of the remaining rows have size y ^. Concatenating with the first identity 
basis element then produces an equiangular tight frame which is full spark: 



g -27ri/5 
g -27ri4/5 



g -27ri2/5 



g -27ri3/5 
g -27ri2/5 



g -27ri4/5 
g -27ri/5 



1 






;i2) 



Proof of Theorem Let F denote the resulting M x ( + K) frame. We start by verifying 
that F is unit norm. Certainly, the identity basis elements have unit norm. For the remaining 
frame elements, the modulus of each entry is determined by D, and so the norm squared of 
each frame element is 

To demonstrate that F is tight, it suffices to show that FF* = Im- The rows of DH 
are orthogonal since they are scaled rows of the DFT, while the rows of the identity portion 
are orthogonal because they have disjoint support. Thus, FF* is diagonal. Moreover, the 
norm squared of each of the first K rows is ^( ^"mat^^ ) + 1 = ^ti^ ■, while the norm squared 
of each of the remaining rows is N^ '^^tlf ) = ^t/^ , and so FF* 



N+K J 



M- 



MN I M ' " ^ ^ 

To demonstrate that F is full spark, first note that every M x M submatrix of DH is 
invertible since 

|det((M);c)| = |det(M^)| = |det(D)||det(i/yc)| > 0, 

by Chebotarev's theorem. Also, in the case where K = M, we note that the M x M 
submatrix of F composed solely of identity basis elements is trivially invertible. The only 
remaining case to check is when identity basis elements and columns of DH appear in the 
same M x M submatrix F/c- In this case, we may shuffle the rows of F/c to have the form 

'A 
B Ik] 

Since shuffling rows has no impact on the size of the determinant, we may further use a 
determinant identity on block matrices to get 

'A 



|det(F/ 



K) 



det 



B I 



K 



|det(A)det(Jx)| = I det (A) I . 



Since A is a multiple of a square submatrix of the N x N DFT, we are done by Chebotarev's 
theorem. □ 

As an example of Theorem |6j pick A^ to be a prime congruent to 1 mod 4, and select 
rows of the N x N DFT according to the index set M := {fc^ : k G I^n}- If we 
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take K = 1, the process in Theorem |6] produces an equiangular tight frame of redundancy 2, 
which can be verified using quadratic Gauss sums; in the case where N = 5, this construction 
produces ( |I2| ). Note that this corresponds to a special case of a construction in Zauner's 
thesis [51j, which was later studied by Renes [38] and Strohmer |13]. Theorem [6] says that 
this construction is full spark. 

Maximally sparse frames have recently become a subject of active research [121 EDI- 
note that when K = M, Theorem |6] produces a maximally sparse M x (A^ + K) full spark 
frame, having a total of M{M — 1) zero entries. To see that this sparsity level is maximal, 
we note that if the frame had any more zero entries, then at least one of the rows would 
have M zero entries, meaning the corresponding M x M submatrix would have a row of all 
zeros and hence a zero determinant. Similar ideas were studied previously by Nakamura and 
Masson [P]. 

Another interesting case is where K = M = N, i.e., when the frame constructed in 
Theorem |6] is a union of the unitary DFT and identity bases. Unions of orthonormal bases 
have received considerable attention in the context of sparse approximation [161 HZ]- In 
fact, when iV is a perfect square, concatenating the DFT with an identity basis forms the 
canonical example F of a dictionary with small spark [TU]. To be clear, the Dirac comb of 
a/ZV spikes is an eigenvector of the DFT, and so concatenating this comb with the negative of 
its Fourier transform produces a 2A/iV-sparse vector in the nuUspace of F. In stark contrast, 
when is prime. Theorem |6| shows that F is full spark. 

The vast implications of Chebotarev's theorem leads one to wonder whether the result 
admits any interesting generalization. In this direction, Candes et al. [9] note that any 
such generalization must somehow account for the nontrivial subgroups of Z^v which are not 
present when N is prime. Certainly, if one could characterize the full spark submatrices of a 
general DFT, this would provide ample freedom to optimize full spark frames for additional 
considerations. While we do not have a characterization for the general case, we do have one 
for the case where iV is a prime power. Before stating the result, we require a definition: 

Definition 7. We say a subset M. 'Z'Ln is uniformly distributed over the divisors of if, 

for every divisor d of N, the d cosets of {d) partition M. into subsets, each of size [^^J or 

At first glance, this definition may seem rather unnatural, but we will discover some 
important properties of uniformly distributed rows from the DFT. As an example, we take 
a short detour by considering the restricted isometry property (RIP), which has received 
considerable attention recently for its use in compressed sensing. We say a matrix F is 
{K, 5)-RIP if 

(1 - 6)\\xf < \\Fxf < (1 + S)\\xf whenever ||x||o < K. 

Candes and Tao [H] demonstrated that the sparsest '^*x which satisfies ^ can be found 
using £i-minimization, provided F is {2K, \/2 — 1)-RIP. Later, Rudelson and Vershynin [ID] 
showed that a matrix of random rows from a DFT and normalized columns is RIP with high 
probability. We will show that harmonic frames satisfy RIP only if the selected row indices 
are nearly uniformly distributed over sufficiently small divisors of A^. 
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To this end, recall that for any divisor d of A^, the Fourier transform of the d-sparse 
normalized Dirac comb -^X{^) is the ^-sparse normalized Dirac comb \J~fjX{d)- Let F be 
the N X N unitary DFT, and let H be the harmonic frame which arises from selecting rows 
of F indexed by Ai and then normalizing the columns. In order for H to be {K, (5)-RIP, 
Ai must contain at least one member of (d) for every divisor d of N which is < K, since 
otherwise 



which violates the lower RIP bound at a; = fact, the RIP bounds indicate that 



2 



cannot be more than 6 away from = 1. Similarly, taking x to be -^X(i^) niodulated by 
a, i.e., x[n] := 4^X/iV\He^™"/^ for every n G Z^r, gives that H-f/'xp = r^nlM fl (a + {d))\ 



is also no more than 6 away from 1. This observation gives the following result: 

Theorem 8. Select rows indexed by M. <^ Tj]^ from the N x N discrete Fourier transform 
matrix and then normalize the columns to produce the harmonic frame H. Then H satisfies 
the {K, 6) -restricted isometry property only if 



\Mn{a+ {d)) \ - 



\Mi 

d 



for every divisor d of N with d < K and every a = 0, . . . , — 1. 

Now that we have an intuition for uniform distribution in terms of modulated Dirac combs 
and RIP, we take this condition to the extreme by considering uniform distribution over all 
divisors. Doing so produces a complete characterization of full spark harmonic frames when 

is a prime power: 

Theorem 9. Let N he a prime power, and select rows indexed by Ai C Zjv from the N x N 
discrete Fourier transform matrix to build the submatrix F. Then F is full spark if and only 
if M. is uniformly distributed over the divisors of N. 

Note that, perhaps surprisingly, an index set Ai can be uniformly distributed over p but 
not over p^, and vice versa. For example, Ai = {0, 1,4} is uniformly distributed over 2 but 
not 4, while Ai = {0, 2} is uniformly distributed over 4 but not 2. 

Since the first M rows of a DFT form a full spark Vandermonde matrix, let's check that 
this index set is uniformly distributed over the divisors of N. For each divisor d of N, we 
partition the first M indices into the d cosets of (d) . Write M = qd + r with < r < rf. The 
first qd of the M indices are distributed equally amongst all d cosets, and then the remaining 
r indices are distributed equally amongst the first r cosets. Overall, the first r cosets contain 
q + 1 = [^J + 1 indices, while the remaining d — r cosets have q = [^J indices; thus, the 
first M indices are indeed uniformly distributed over the divisors of N. Also, when N is 
prime, every subset of Zjv is uniformly distributed over the divisors of A^ in a trivial sense. 
In fact, Chebotarev's theorem follows immediately from Theorem |9| In some ways, portions 
of our proof of Theorem [9] mirror recurring ideas in the existing proofs of Chebotarev's 
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theorem [13 [ISl IHl US]- For the sake of completeness, we provide the full argument and 
save the reader from having to parse portions of proofs from multiple references. We start 
with the following lemmas, whose proofs are based on the proofs of Lemmas 1.2 and 1.3 
in 1461. 



Lemma 10. Let N be a power of some prime p, and let P{zi, . . . , zm) be a polynomial with 
integer coefficients. Suppose there exists Nth roots of unity {(jJm\m=i ■^'^^c/i that P{uji, . . . , um) - 
0. Then P{1, . . . , 1) is a multiple of p. 

Proof. Denoting u := e~^'^*/^, then for every m = 1, . . . , M, we have Um = w'^™ for some 
< km < N. Defining the polynomial Q{z) := P{z^'^ z^ we have Q{uj) = by 
assumption. Also, Q{z) is a polynomial with integer coefficients, and so it must be divisible 
by the minimal polynomial of namely, the cyclotomic polynomial $Ar(z). Evaluating both 
polynomials at 2; = 1 then gives that p = $Ar(l) divides Q{1) = P{1, . . . , 1). □ 

Lemma 11. Let N be a power of some prime p, and pick Ai = {mi}f^^ ^ Zat such that 

n 

l<i<j<M 



[nij - rrii ) 



(13) 



M-l 
m=0 

is not a multiple of p. Then the rows indexed by Ai in the N x N discrete Fourier transform 
form a full spark frame. 



Proof. We wish to show that det{ul^)meM,i<n<M 7^ for all M-tuples of distinct A^th roots of 
unity {un}n=i- Define the polynomial D{zi, . . . ,zm) '■= det{zl^)m£M,i<n<M ■ Since columns 
i and j of (-2™)meyVi,i<n<Af are identical whenever Zi = Zj, we know that D vanishes in each 
of these instances, and so we can factor: 



D{zi,...,Zm) = Pizi,...,ZM) Yl ' 

l<i<j<M 



for some polynomial P{zi, . . . , zm) with integer coefficients. By Lemma 10, it suffices to 
show that P(l, . . . , 1) is not a multiple of p, since this implies -0(^1, . . . , um) is nonzero for 
all M-tuples of distinct A^th roots of unity {cOn}n=i- 
To this end, we proceed by considering 



dVf dV ( d 



(14) 

Zl=--- = ZM = l 



To compute A, we note that each application of Zj-^ produces terms according to the 
product rule. For some terms, a linear factor of the form Zj — Zi or Zi — Zj is replaced by 
Zj or — Zj, respectively. For each the other terms, these linear factors are untouched, while 
another factor, such as P{zi., . . . , zm)i is differentiated and multiplied by Zj. Note that there 
are a total of M{M — l)/2 linear factors, and only M{M — l)/2 differentiation operators to 
apply. Thus, after expanding every product rule, there will be two types of terms: terms in 
which every differentiation operator was applied to a linear factor, and terms which have at 
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least one linear factor remaining untouched. When we evaluate at zi = ■ ■ ■ = zjv/ = 1? the 
terms with linear factors vanish, and so the only terms which remain came from applying 
every differentiation operator to a linear factor. Furthermore, each of these terms before the 



evaluation is of the form P{zi, . . . , zm) Yl 



l<i<j<M ^ji 



and so evaluation at zi 



produces a sum of terms of the form P(l, . . . , 1); to determine the value of A, it remains 
to count these terms. The M — 1 copies of zm-^^^ can only be applied to linear factors of 
the form zm — Zi, of which there are M — 1, and so there are a total of (M — 1)! ways to 
distribute these operators. Similarly, there are (M — 2)! ways to distribute the M — 2 copies 
of amongst the M — 2 linear factors of the form Zm-i — ^i- Continuing in this 

manner produces an expression for A: 

A={M - 1)!(M - 2)! • • • 1!0! P(l, . . . , 1). (15) 

For an alternate expression of A, we substitute the definition of D(zi, . . . ,zm) into ( |I4| ). 
Here, we exploit the multilinearity of the determinant and the fact that {zn-^)z'^ 
to get 

A = det(m""^)mgA^_i<„<M = Y\. ~ "^*)' 

l<i<j<M 

where the final equality uses the fact that {jn^~^)rn<^M,i<n<M is the transpose of a Vander- 
monde matrix. Equating ( 15 ) to ( 16 ) reveals that ( 13 ) is an expression for P(l, . . . , 1). Thus, 
by assumption, P(l, . . . , 1) is not a multiple of p, and so we are done. □ 



(16) 



Proof of Theorem^ (<^=) We will use Lemma 11 to demonstrate that F is full spark. To 



apply this lemma, we need to establish that ( 13 ) is not a multiple of and to do this, we 



will show that there are as many p-divisors in the numerator of (13) as there are in the 



denominator. We start by counting the p-divisors of the denominator: 



M-1 m 



M-1 M~l 



nm!=nn^=nn 



m=0 



m=l 1=1 



e=l m=l 



(17) 



For each pair of integers k,a > 1, there are max{M — ap , 0} factors in (17) of the form 



i = ap . By adding these, we count each factor i as many times as it can be expressed as a 
multiple of a power of p, which equals the number of p-divisors in i. Thus, the number of 



p-divisors of (17) is 



ap 



k=l a=l 



Next, we count the p-divisors of the numerator of ( 13 ). To do this, we use the fact that Ai 



is uniformly distributed over the divisors of N. Since is a power of p, the only divisors of 
are smaller powers of p. Also, the cosets of (p^) partition Ai into subsets Sk,b '■= {f^i = b 
mod p^}. We note that mj — mj is a multiple of p^ precisely when rrii and mj belong to 
the same subset 5*^,6 for some < b < p''. To count p-divisors, we again count each factor 
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rrij — rrii as many times as it can be expressed as a multiple of a prime power: 

E Eft')- («) 

k=l fe=0 ^ ^ 

Write M = qp^ + r with < r < p'^. Then q = [^J . Since Ai is uniformly distributed over 

p*^, there are r subsets S^^b with g + 1 elements and p'' — r subsets with q elements. We use 
this to get 

E ('1'') = C 1 ' + (2) " ^) = - + + - ^^'0 • 

Rearranging and substituting M = qp^ + r then gives 



15\ 



^ (l^^'^l) = |(2M - (g + 1)/) = Mq - + = - a/) 



Thus, there are as many p-divisors in the numerator (19) as there are in the denominator 



(18), and so (13) is not divisible by p. Lemma 11 therefore gives that F is full spark. 

(^) We will prove that this direction holds regardless of whether is a prime power. 
Suppose C Ztv is not uniformly distributed over the divisors of N. Then there exists 
a divisor of such that one of the cosets of {d) intersects Ai with < [^J — 1 or > 
[^1 + 1 indices. Notice that if a coset of (d) intersects Ai with < [^J — 1 indices, then 

the complement Ai'^ intersects the same coset with > + 1 = + 1 indices. By 

Theorem |4|^iii), Ai produces a full spark harmonic frame precisely when Ai'^ produces a full 
spark harmonic frame, and so we may assume without loss of generality that there exists a 
coset of {d) which intersects Ai with > \^~\ + 1 indices. 

To prove that the rows with indices in Ai are not full spark, we find column entries which 
produce a singular submatrix. Writing M = qd + r with < r < c?, let /C contain q = [^J 
cosets of (^) along with r elements from an additional coset. We claim that the DFT 
submatrix with row entries Ai and column entries /C is singular. To see this, shuffle the rows 
and columns to form a matrix A in which the row entries are grouped into common cosets 
of {d) and the column entries are grouped into common cosets of (^). This breaks A into 
rank-1 submatrices: each pair of cosets a + (d) and b + {^■) produces a submatrix 



7V\ 
d 

/, ,(a+id)(b+j ^)\ , ,ab/, Mi, ,a^j 



for some index sets X and J\ this is a rank-1 outer product. Let L be the largest intersection 
between AA and a coset of (rf). Then |£| > [^] + 1 is the number of rows in the tallest of 
these rank-1 submatrices. Define A^ to be the M x M matrix with entries ^^[i, j] = j] 
whenever i e £ and zero otherwise. Then 

Rank(A) = Rank(A£ + A- Ac)< Ran^Ac) + Rank(A - Ac). (20) 

Since A — Ac has |£| rows of zero entries, we also have 

Rank{A- Ac) < M -\C\ < M - (If] +1). (21) 
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Moreover, since we can decompose Ac into a sum of \^~\ zero-padded rank-1 submatrices, we 



have Rank(A£) < [^] . Combining this with (20) and (21 ) then gives that Rank(y4) < M 



and so the DFT submatrix is not invertible. □ 

Note that our proof of Theorem [9] estabhshes the necessity of having row indices uniformly 
distributed over the divisors of in the general case. This leaves some hope for completely 
characterizing full spark harmonic frames. Naturally, one might suspect that the uniform 
distribution condition is sufficient in general, but this suspicion fails when = 10. Indeed, 
the following DFT submatrix is singular despite the row indices being uniformly distributed 
over the divisors of 10: 

/■ ^—2TTimn/10\ 

JmG{0, 1,3,4} ,ne{0, 1,2,6}- 

Also, just as we used Chebotarev's theorem to analyze the harmonic equiangular tight 
frames from Xia et al. we can also use Theorem [9] to determine whether harmonic 
equiangular tight frames with a prime power number of frame elements are full spark. Un- 
fortunately, none of the infinite families in |19] have the number of frame elements in the 
form of a prime power (other than primes). Luckily, there is at least one instance in which 
the number of frame elements happens to be a prime power: the harmonic frames that arise 
from Singer difference sets have M = ^fip and = for a prime power q and an 

integer d > 2; when g = 3 and d = A, the number of frame elements N = 11^ is a prime 
power. In this case, the row indices we select are 

M ={1, 2, 3, 6, 7, 9, 11, 18, 20, 21, 25, 27, 33, 34, 38, 41, 44, 47, 53, 54, 55, 56, 
58, 59, 60, 63, 64, 68, 70, 71, 75, 81, 83, 89, 92, 99, 100, 102, 104, 114}, 

but these are not uniformly distributed over 11, and so the corresponding harmonic frame 
is not full spark by Theorem [9j 



3. Full spark Parseval frames are dense 

Recently, Lu and Do [29] showed that full spark frames are dense in the entire set of 
matrices. This corresponds to our intuition that a matrix whose entries are independent 
continuous random variables is full spark with probability one, which was also recently 
proved by Blumensath and Davies [S]. By contrast, as noted by Gorodnitsky and Rao [22], 
certain classes of frames which arise in practice, such as in physical tomography, are never full 
spark. This issue also occurs in frame theory: Steiner ETFs form one of the largest known 
classes of ETFs, and yet none of them are full spark [20]. As such, Bourguignon et al. [^ were 
prompted to prove that, among all Vandermonde frames with bases in the complex unit circle, 
full spark frames are dense. In this section, we consider a very important class of frames, 
namely, those which exhibit Parseval tightness, where the frame bound is 1. Specifically, we 
show that full spark Parseval frames are dense in the entire set of Parseval frames. Unlike 
the previous work in this vein, our techniques exploit general concepts in algebraic geometry, 
lending themselves to future application in proving further density results. 

In order to make our arguments rigorous, we view each entry Fmn of the M x N matrix 
F in terms of its real and imaginary parts: Xmn + Wmn] this decomposition will become 
helpful later when we consider inner products between rows of F, which are not algebraic 
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operations on complex vectors. Recall that F is full spark precisely when each M x M 
submatrix has nonzero determinant. We note that for each submatrix, the determinant is a 
polynomial in the XmnS and i/mnS, and having this polynomial be nonzero is equivalent to 
having either its real or imaginary part be nonzero. This naturally leads us to the following 
definition from algebraic geometry: A real algebraic variety is the set of common zeros of a 
finite set of polynomials, that is, given polynomials pi, . . . ,pr G R[xi, . . . , Xk], we define the 
corresponding real algebraic variety by 

V{pi, ...,pr):={xeR'': pi{x) = ■■■= pr{x) = 0}. 

Each submatrix determinant corresponds to a real algebraic variety V C M^^^ of two 
polynomials, and having this determinant be nonzero is equivalent to restricting to the 
complement of V. 

In general, a variety is equipped with a topology known as the Zariski topology, in which 
subvarieties are the closed sets. As such, the set of matrices with a nonzero determinant 
forms a Zariski-open set, since it is the complement of a variety. In order to exploit this 
Zariski-openness, we require the additional concept of irreducibility: A variety is said to be 
irreducible if it cannot be written as a finite union of proper subvarieties. As an example, 
the entire space M'^ is the variety which corresponds to the zero polynomial; in this case, 
every proper subvariety is lower- dimensional, and so M.^ is trivially irreducible. On the other 
hand, the variety in defined hj xy = is not irreducible because it can be expressed as 
the union of varieties defined by x = and y = 0. Irreducibility is important because it says 
something about Zariski-open sets: 

Theorem 12. // V is an irreducible algebraic variety, then every nonempty Zariski-open 
subset of V is dense in V in the standard topology. 

For example, the hyperplane defined by xi = is a subvariety of M^, and since M*' is 
irreducible, the complement of the hyperplane is dense in the standard topology. Going 
back to the variety defined by = 0, we know it can be expressed as a union of proper 
subvarieties, namely the x- and y-axes, and complementing one gives a subset of the other, 
neither of which is dense in the entire variety. 

We are now ready to prove the following result: 

Theorem 13. Every matrix is arbitrarily close to a full spark frame. 

Proof. In an M X iV matrix, there are (^) submatrices of size M x M, and the determinant 
of each of these submatrices corresponds to a variety of two real polynomials. Since the set 
of M X full spark frames is defined as the (finite) intersection of the complements of these 
varieties, it is a Zariski-open subset of the irreducible variety M^^^ of all M x A^ matrices. 
Moreover, this set is nonempty since it contains the matrix formed by the first M rows of 



the N X N DFT, and so we are done by Theorem 12 □ 

We now focus on the set of Parseval frames, that is, tight frames with frame bound 1. 
Note that M x N Parseval frames are characterized by the rows forming an orthonormal 
system of size M in A^-dimensional space. The set of all such orthonormal systems is known 
as the Stiefel manifold, denoted St(M, A^). In general, a manifold is a set of vectors with 
a well-defined tangent space at every point in the set. Since this manifold property is nice. 
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we would like to think of varieties as manifolds, but there exist varieties with points at 
which tangent spaces are not well-defined; such points are called singularities. Note that our 
definition of singularity is geometric, i.e., where the variety fails to be a real differentiable 
manifold, as opposed to algebraic. For example, the variety defined hj xy = is the union 
of the X- and y-axes, and as such, has a singularity at x = y = 0. Certainly with different 
types of varieties, there are other types of singularities which may arise, but there are also 
many varieties which do not have singularities at all, and we call these varieties nonsingular. 
Nonsingularity is a useful property for the following reason: 

Theorem 14. An algebraic variety which is nonsingular and connected is necessarily irre- 
ducible. 

This result follows from Theorem 1.5.1 and Remark III. 7.9.1 of Hartshorne [25j, which 
assumes that the variety is over an algebraically closed field, unlike M; however, the proof is 
unaffected when removing the algebraically-closed assumption. 

Note that the orthonormality conditions which characterize Parseval frames F can be 
expressed as polynomial equations in the real and imaginary parts of the entries Fmn- As 
such, we may view the Stiefel manifold as a real algebraic variety; this variety is nonsingular 
because it is a manifold. Moreover, the variety is connected because the (connected) unitary 



group U(A^) acts transitively on St(M, A^). Thus by Theorem 14, the variety of Parseval 
frames is irreducible. Having established this, we can now prove the following result: 

Theorem 15. Every Parseval frame is arbitrarily close to a full spark Parseval frame. 

Proof. Proceeding as in the proof of Theorem [13} we have that the set of M x full spark 
Parseval frames is Zariski-open in the irreducible variety of all M x iV Parseval frames. Again 
considering the first M rows of the N x N DFT, we know this set is nonempty, and so we 



are done by Theorem 12 □ 



We note that Theorems 13 and 15 are also true when we further require the frames to 



be real. In this case, we cannot use the first M rows of the N x N DFT to establish that 



the Zariski-open sets are nonempty. For the real version of Theorem 13, we use an M x 



Vandermonde matrix with distinct real bases; see Lemma [2j However, this construction 



must be modified to use it in the proof of the real version of Theorem [T5| since such Vander- 
monde matrices will not be tight; see Theorem |3] Given an M x full spark frame F, the 
modification G := (^FF*)~^^'^F is full spark and Parseval; indeed, GG* = Im, and since F is 
a frame, (_F_F*)~i/2 is full rank, and so the columns of G are linearly independent precisely 
when the corresponding columns of F are linearly independent. 



Another way that the proof of Theorem 15 changes in the real case is in verifying that the 



real Stiefel manifold is irreducible. Just as in the complex case, this follows from Theorem [14 
since St(M, A^) is connected |37], but the fact that St(M, A^) is connected in the real case 
is not immediate. By analogy, the orthogonal group O(A^) certainly acts transitively on 
St(M, A^), but unlike the unitary group, the orthogonal group has two connected components. 
Intuitively, this is resolved by the fact that A^ > M, granting additional freedom of movement 
throughout St(M, A^). 



In addition to Theorems 13 and 15, we would like a similar result for unit norm tight 



frames, i.e., that every unit norm tight frame is arbitrarily close to a full spark unit norm 
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tight frame. Certainly, the set of unit norm tight frames is a real algebraic variety, but 
it is unclear whether this variety is irreducible. Without knowing whether the variety is 



irreducible, we can follow the proofs of Theorems 13 and 15 to conclude that full spark unit 
norm tight frames are dense in the irreducible components in which they exist — a far cry from 
the density result we seek. This illustrates a significant gap in our current understanding of 
the variety of unit norm tight frames. It should be mentioned that Strawn |12] showed that 
the variety of M x unit norm tight frames (over real or complex space) is nonsingular 
precisely when M and are relatively prime. Additionally, Dykema and Strawn |17j proved 
that the variety of 2 x A^ real unit norm tight frames is connected, and so by Theorem Il4l 



this variety is irreducible when A^ is odd. It is unknown whether the variety of M x A^ unit 



norm tight frames is connected in general. Finally, we note that a Theorem 15 gives a weaker 
version of the result we would like: every unit norm tight frame is arbitrarily close to a full 
spark tight frame with frame element lengths arbitrarily close to 1. 

4. The computational complexity of verifying full spark 

In the previous section, we demonstrated the abundance of full spark frames, even after 
imposing the additional condition of tightness. But how much computation is required to 
check whether a particular frame is full spark? At the heart of the matter is computational 
complexity theory, which provides a rigorous playing field for expressing how hard certain 
problems are. In this section, we consider the complexity of the following problem: 

Problem 16 (FULL Spark). Given a matrix, is it full spark? 

For the lay mathematician. Full Spark is "obviously" NP-hard because the easiest way 
he can think to solve it for a given M x N matrix is by determining whether each of the 
M X M submatrices is invertible; computing (j^) determinants would do, but this would take 
a lot of time, and so Full Spark must be NP-hard. However, computing (^) determinants 
may not necessarily be the fastest way to test whether a matrix is full spark. For example, 
perhaps there is an easy-to-calculate expression for the product of the determinants; after all, 
this product is nonzero precisely when the matrix is full spark. Recall that Theorem [9] gives 
a very straightforward litmus test for FuLL Spark in the special case where the matrix is 
formed by rows of a DFT of prime-power order — who's to say that a version of this test does 
not exist for the general case? If such a test exists, then it would suffice to find it, but how 
might one disprove the existence of any such test? Indeed, since we are concerned with the 
necessary amount of computation, as opposed to a sufficient amount, the lay mathematician's 
intuition is a bit misguided. 

To discern how much computation is necessary, the main feature of interest is a problem's 
complexity. We use complexity to compare problems and determine whether one is harder 
than the other. As an example of complexity, intuitively, doubling an integer is no harder 
than adding integers, since one can use addition to multiply by 2; put another way, the 
complexity of doubling is somehow "encoded" in the complexity of adding, and so it must 
be lesser (or equal). To make this more precise, complexity theorists use what is called a 
polynomial-time reduction, that is, a polynomial-time algorithm that solves problem A by 
exploiting an oracle which solves problem B; the reduction indicates that solving problem 
A is no harder than solving problem B (up to polynomial factors in time), and we say "A 
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reduces to B" or A < B. Since we can use the polynomial-time routine a; + a; to produce 
2x, we conclude that doubling an integer reduces to adding integers, as expected. 

In complexity theory, problems are categorized into complexity classes according to the 
amount of resources required to solve them. For example, the complexity class P contains 
all problems which can be solved in polynomial time, while problems in EXP may require 
as much as exponential time. Problems in NP have the defining quality that solutions can 
be verified in polynomial time given a certificate for the answer. As an example, the graph 
isomorphism problem is in NP because, given an isomorphism between graphs (a certificate), 
one can verify that the isomorphism is legit in polynomial time. Clearly, P C NP, since we 
can ignore the certificate and still solve the problem in polynomial time. Finally, a problem 
B is called NP-hard if every problem A in NP reduces to B, and a problem is called NP- 
complete if it is both NP-hard and in NP. In plain speak, NP-hard problems are harder than 
every problem in NP, while NP-complete problems are the hardest of problems in NP. 

At this point, it should be clear that NP-hard problems are not merely problems that seem 
to require a lot of computation to solve. Certainly, NP-hard problems have this quality, as 
an NP-hard problem can be solved in polynomial time only if P = NP; this is an open 
problem, but it is widely believed that P 7^ NP. However, there are other problems which 
seem hard but are not known to be NP-hard (e.g., the graph isomorphism problem). Rather, 
to determine whether a problem is NP-hard, one must find a polynomial-time reduction that 
compares the problem to all problems in NP. To this end, notice that A < B and B < C 
together imply A < C, and so to demonstrate that a problem C is NP-hard, it suffices to 
show that B < C for some NP-hard problem B. 

Unfortunately, it can sometimes be difficult to find a deterministic reduction from one 
problem to another. One example is reducing the satisfiability problem (SAT) to the unique 
satisfiability problem (Unique SAT). To be clear, SAT is an NP-hard problem [28] that 
asks whether there exists an input for which a given Boolean function returns "true," while 
Unique SAT asks the same question with an additional promise: that the given Boolean 
function is satisfiable only if there is a unique input for which it returns "true." Intuitively, 
Unique SAT is easier than SAT because we might be able to exploit the additional struc- 
ture of uniquely satisfiable Boolean functions; thus, it could be difficult to find a reduction 
from SAT to Unique SAT. Despite this intuition, there is a randomized polynomial-time 
reduction from SAT to Unique SAT |18]. Defined over all Boolean functions of n vari- 
ables, the reduction maps functions that are not satisfiable to other functions that are not 
satisfiable, and with probability > g^, it maps satisfiable functions to uniquely satisfiable 
functions. After applying this reduction to a given Boolean function, if a Unique SAT 
oracle declares "uniquely satisfiable," then we know for certain that the original Boolean 
function was satisfiable. But the reduction will only map a satisfiable problem to a uniquely 
satisfiable problem with probability > g^, so what good is this reduction? The answer lies 
in something called amplification; since the success probability is, at worst, polynomially 
small in n (i.e., we can repeat our oracle-based randomized algorithm a polynomial 

number of times np{n) and achieve an error probability < (1 — j^)"^*^""* ~ e"" which is 
exponentially small. 
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In this section, we give a randomized polynomial-time reduction from a problem in ma- 
troid theory. Before stating the problem, we first briefly review some definitions. To each 
bipartite graph with bipartition {E,E'), we associate a transversal matroid {E,X), where X 
is the collection of subsets of E whose vertices form the ends of a matching in the bipartite 
graph; subsets in X are called independent. Hall's marriage theorem j2l] gives a remarkable 
characterization of the independent sets in a transversal matroid: 5 G X if and only if every 
subset A C B has > \A\ neighbors in the bipartite graph. Next, just as spark is the size of 
the smallest linearly dependent set, the girth of a matroid is the size of the smallest subset of 
E that is not in X. In fact, this analogy goes deeper: A matroid is representable over a field 
¥ if, for some M, there exists a mapping ip: E F'^'^ such that f{A) is linearly independent 
if and only if A G X; as such, the girth of {E,X) is the spark of <f{E). In our reduction, we 
make use of the fact that every transversal matroid is representable over M [35] . We are now 
ready to state the problem from which we will reduce Full Spark: 

Problem 17. Given a bipartite graph, what is the girth of its transversal matroid? 



Before giving the reduction, we will show that Problem [T7| is NP-hard. The result comes 
from McCormick's thesis [31], which credits the proof to Stockmeyer; since [31] is difficult 
to access and the proof is instructive, we include it below: 



Theorem 18. Problem 11 is HP-hard. 



Proof. We will reduce from the NP-complete clique decision problem, which asks "Given a 
graph, does it contain a clique of K vertices?" [2B]- First, we may assume K > A without loss 
of generality, since any such clique can otherwise be found in cubic time by an exhaustive 
search. Take a graph G = (y,E), and consider the bipartite graph G' between disjoint sets 
E and ^ U {1, . . . , — K — 1}, in which e v for every e G E and f G e, and e k 
for every e G E and k G {1, . . . , (^) — K — 1}. We claim that the girth of the transversal 
matroid of G' is (^) precisely when there exists a i^-clique in G. 

We start by analyzing the girth of a transversal matroid. Consider any dependent set 
C ^ E with < |C| — 2 neighbors in G'. Then removing any member x of C will produce 
a smaller set C \ {x} with < |C| — 2 = |C \ {x}\ — 1 neighbors in G', which is necessarily 
dependent by the pigeonhole principle. Now consider any dependent set C C with > |C| 
neighbors in G'. By the Hall's marriage theorem, there exists a proper subset C ^ C with 
< \C'\ neighbors in G', meaning C is a smaller dependent set. Thus, the girth c is the size 
of the smallest subset C E with |C| — 1 total neighbors in G'. 

Suppose c = (^). Then since C is adjacent to every vertex in {1, . . . , ) - - 1}, C 
has (c - 1) - ((f) -K -1) = K neighbors in V. These are precisely the vertices D in G 
which are induced by the edges in C, and so C is contained in the set C of edges induced 
by D, of which there are < ('f') = (f), with equality only if D induces a fT-clique in G. 
Since (^) = \C\ < \C'\ < ('■^') = (^) implies equality, there exists a ii'-clique in G. 

Now suppose there exists a i^-clique with edges C. Then C has (f ) elements and K + 
(('^) — K — 1) = (■^2) ^ 1 neighbors in G'. To prove that c = (f), it suffices to show that 
there is no smaller subset C E with \C'\ — 1 total neighbors in G'. Suppose, to the 
contrary, that c = (f ) — i for some i > 0. Then there exists C E with c elements and 
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(c — 1) — ((^) — K ~1) = K — i neighbors in V. Note that each e G E contains two vertices 
in G, and so by the definition of G', G' necessarily has K — i > 2 neighbors in V. Also, 
since the K — i neighbors of G' in V arise from the subgraph of G induced by G', and since 
those neighbors induce at most {^2^) edges including C", we have {^) —i = c< {^2^)- This 
inequality simplifies to £ > 2K — 3, which combines with K — i > 2 to contradict the fact 
that i>0. □ 



Having established that Problem [T7] is NP-hard, we reduce from it the main problem of 
this section. Our proof is specifically geared toward the case where the matrix in question 
has integer entries; this is stronger than manipulating real (complex) numbers exactly as 
well as with truncations and tolerances. 

Theorem 19. Full Spark is hard for NP under randomized polynomial-time reductions. 



Proof. We will give a randomized polynomial-time reduction from Problem 17 to Full 
Spark. As such, suppose we are given a bipartite graph G, in which every edge is between 
the disjoint sets A and B. Take M := \B\ and N := 1^41 . Using this graph, we randomly draw 
an M X matrix F using the following process: for each i E B and j G A, pick the entry 
Fij randomly from {1, . . . , A^2^+^} if i ^ j in G; otherwise set Fij = 0. In Proposition 3.11 
of [30], it is shown that the columns of F form a representation of the transversal matroid 
of G with probability > |. For the moment, we assume that F succeeds in representing the 
matroid. 

Since the girth of the original matroid equals the spark of its representation, for each 
K = 1, . . . , M, we test whether Spark(F) > K. To do this, take H to be some M x P 
full spark frame. We will determine an appropriate value for P later, but for simplicity, we 
can take H to be the Vandermonde matrix formed from bases {1, . . . ,-P}; see Lemma [2] 
We claim we can randomly select K indices /C C {1, . . . , P} and test whether H^F is full 
spark to determine whether Spark(F) > K. Moreover, after performing this test for each 
K = 1,. . . , M, the probability of incorrectly determining Spark(F) is < |, provided P is 
sufficiently large. 

We want to test whether H^F is full spark and use the result as a proxy for whether 
Spark(F) > K. For this to work, we need to have R.ank{H^Fic') = K precisely when 
Rank(Fx:') = K for every /C' C {1, . . . , A^} of size K. To this end, it suffices to have the 
nuUspace M{H^) of intersect trivially with the column space of F^^i for every K,' . To be 
clear, it is always the case that Rank(if^Fj<;/) < Rank(F/(;/), and so Rank(Fx:') < K implies 
Rank(if^Fj<;') < K. If we further assume that Af{H^) fl Span(F/c') = {0}, then the converse 
also holds. To see this, suppose Rank(if^F/c') < K. Then by the rank-nullity theorem, 
there is a nontrivial x G Af{H^Fic'). Since H^Ffc'X = 0, we must have F/c'X G Af{H^), 
which in turn implies x G Af{F/c') since M{H^) fl Span(Fyc') = {0} by assumption. Thus, 
Rank(Fyc/) < A' by the rank- nullity theorem. 

Now fix /C' C {1, . . . , A^} of size K such that Rank(F/c') = K. We will show that the vast 
majority of choices /C C {1, . . . , P} of size K satisfy Af{H^) fl Span(Px:') = {0}- To do this, 
we consider the columns {hk}keK: of Hjc one at a time, and we make use of the fact that 
Af{H^) = Clk^jc-^ihl). In particular, since H is full spark, there are at most M — K columns 
of H in the orthogonal complement of Span(Px:')5 so there are at least P — (M — K) 
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choices of /i^^ for which jV{hl_^) does not contain Span(F/t;'), i.e., 

dim(^A/'(/i*J n Span(F/cO) = K - I. 

Similarly, after selecting the first J /i^'s, we have dim(5') = K — J, where 

J 

S:=f]Ar{hl^)nSp^n{F^>). 

i=i 

Again, since H is full spark, there are at most M — [K — J) columns of H in the orthogonal 
complement of S", and so the remaining P — (M — {K — J)) columns are candidates for h^jj^^ 
that give 

/ J+l N 

dim( fl M{hl) n Span(F^O = A\m{M{hl,J n S) = K - {J + I). 

Overall, if we randomly pick /C C {1, . . . , P} of size K, then 

Pr(Ar(i7*) n Span(F^O = {0}) > (1 - ^)(1 - ^^^^) ■ ■ ■ (1 - ^) 

>(l-f)^ 
— ^ pi 

where the final step is by Bernoulli's inequality. Taking a union bound over all choices of 
^ {1) • • • ) and all values of -ft" = 1, . . . , M then gives 

^ ^ /A^\ Mir 



K 



K=l 

< . 

- P 

Thus, to make the probability of failure < |, it suffices to have P = M^2^+^. 

In summary, we succeed in representing the original matroid with probability > ^, and 
then we succeed in determining the spark of its representation with probability > ^. The 
probability of overall success is therefore > |. Since our success probability is, at worst, 
polynomially small, we can apply amplification to achieve an exponentially small error prob- 
ability. □ 

Our use of random linear projections in the above reduction to Full Spark is simi- 
lar in spirit to Valiant and Vazirani's use of random hash functions in their reduction to 
Unique SAT [IS]. Since their randomized reduction is the canonical example thereof, we 
find our reduction to be particularly natural. 

As a final note, we clarify that Theorem 19 is a statement about the amount of computation 
necessary in the worst case. Indeed, the hardness of Full Spark does not rule out the 
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existence of smaller classes of matrices for which full spark is easily determined. As an 
example, Theorem |9] determines Full Spark in the special case where the matrix is formed 
by rows of a DFT of prime-power order. This illustrates the utility of applying additional 
structure to efficiently solve the Full Spark problem, and indeed, such classes of matrices 
are rather special for this reason. 
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